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Draft Genome Sequence of Uropathogenic Escherichia coli Strain J96 
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Escherichia coli J96 (04:K6) was isolated from a human pyelonephritis patient. Here, we report the draft genome sequence of 
E. coli J96, which contains virulence genes, including adhesion factors, alpha-hemolysins, and cytotoxic necrotizing factor. J96 
infects the kidney and bladder, making it an important tool for studying E. coli pathogenesis. 
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Uropathogenic Escherichia coli (UPEC) strains are the most 
common bacteria that cause urinary tract infections. UPEC 
strains differ primarily by the presence of various virulence fac- 
tors, often encoded on pathogenicity islands (PAIs). E. coli strain 
J96 was isolated from a human pyelonephritis patient (1). Several 
studies have characterized J96 virulence factors, including ad- 
hesins (pap, prs, foe, and fim), alpha-hemolysins {hly), and the 
cytotoxic necrotizing factor (cm/) (2, 3). In addition to chromo- 
somal pathogenicity factors, J96 carries an ~115-kb plasmid 
(pJ96) (4) . The variety of virulence genes, together with the ability 
of J96 to infect bladder and kidney tissues, makes this strain a 
valuable tool for studying UPEC pathogenesis. A previous work 
produced a physical-genetic map of the J96 genome (4), as well as 
the sequences of several of pathogenic factors (5,6). In this report, 
the fuU chromosome and plasmid sequences were subjected to 
genomic analysis. 

Three paired-end libraries were prepared: (i) total DNA (200 
bp), (ii) total DNA (500 bp), and (iii) plasmid DNA (200 bp). The 
genome sequence was determined using shotgun sequencing 
(read length, 101 bp) on an lUumina HiSeq 2000 (library 1, 
116,959,856 reads and 2,300X coverage; library 2, 85,721,146 
reads and 1,690 X coverage; plasmid, 30,608,372 reads and 
26,800 X coverage). The reads were quality-filtered and randomly 
sampled to achieve 100 X coverage for the assembly. The plasmid 
sequence was assembled using Edena v3 (7), resulting in 14 con- 
tigs. Plasmid contaminants in the total DNA libraries were re- 
moved by aligning reads to the plasmid contigs using Bovrtie (8). 
The chromosomal reads were assembled in two phases. De novo 
assembly was done with Edena v3 (7), Assembly by Short Se- 
quences (ABySS) vl.3.1 (9), and Velvet vl. 2 (10). Reads were also 
ahgned to UPEC strains UTI89 (accession no. NC_007946), 
CFT073 (accession no. NC_004431) (11), and EC536 (accession 
no. NC_008253). The de novo assemblies and alignment-based 
contigs were merged using Gap4 (12) and scaffolded with SSAKE- 
based Scaffolding of Pre-Assembled Contigs after Extension 
(SSPACE) v2.0 (13), and gaps were PCR-amplified and sequenced 
by Sanger sequencing. The assembly of chromosomal sequences 
resulted in 262 contigs, which were annotated via the NCBI Pro- 
karyotic Genome Annotation Pipeline (PGAAP) (14). The J96 
chromosome is 5.51 Mb and its G-HC content is 51.1%. pJ96 is 
1 12 kb and its G-l-C content is 50.9%. The chromosome and plas- 



mid contain 5,259 and 140 protein-coding sequences, respec- 
tively. The chromosome also contains 28 rRNA and 105 tRNA 
genes. 

Phylogenetic analysis based on average nucleotide index 
(ANI) values (15) among E. coli strains showed that J96 is most 
similar to other UPEC strains. At least three PAIs were identified: 
PAI-I (pap), PAI-II (hly, prs, cat-1 [chloramphenicol resistance], 
and fim), and PAI-III (/oc). Additionally, the genome encodes 
numerous transposases and insertion sequences, highlighting 
the importance of genomic exchange in creating the J96 patho- 
genic phenotype. 

Alignment of pJ96 to other UPEC plasmids revealed high sim- 
ilarity to pUTI89 (accession no. NC_007941), though pJ96 lacked 
several hypothetical proteins (P018-020 and P084-085) and con- 
tained 6 single nucleotide polymorphisms (SNPs). 

Nucleotide sequence accession numbers. E. coli strain J96 can 
be obtained from ATCC (no. 700336). This Whole Genome Shot- 
gun project has been deposited at DDBJ/EMBL/GenBank under 
the accession no. ALINOOOOOOOO. The version described in this 
paper is the first version, ALIN02000000. 
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